🎮 Reinforcement Learning - emmmmdty · Scour

Show HN: Fighting the War Against Expensive Reinforcement Learning

cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app·4h·

Discuss: Hacker News

Rollout-Training Co-Design for Efficient LLM-Based Multi-Agent Reinforcement Learning

arxiv.org·1d

Homing through Reinforcement Learning

arxiv.org·2d

check out this article on Reinforcement Learning with R: Origins, Real-Life Applications, and Practical Implementation

dev.to·2d·

Discuss: DEV

🔄Transformers

A multi-agent reinforcement learning approach to autonomous aircraft taxiing with taxiing time, fuel consumption, and emission optimization

sciencedirect.com·22h

Multi AI Agent Systems with crewAI

deeplearning.ai·38m

A training principle for drifting models

breno.bearblog.dev·45m

🤖Machine Learning

Feedback Control for Computer Systems

janert.org·4h

Observe emergent behavior in autonomous multi-agent LLM networks

agents.glide2.app·1d·

Discuss: Hacker News

AI Agents Explained in 3 Levels of Difficulty

kdnuggets.com·1d·

Discuss: Hacker News

Robotics Motion Learning: Training Linked Robot Arms with Kuramoto Models

hackernoon.com·20h

GLM-5: From Vibe Coding to Agentic Engineering

simonwillison.net·17h·

Discuss: Hacker News

Why the future of AI belongs to models that simulate reality

sifted.eu·2h

JupyterPS/VBAF: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation

github.com·1d·

Discuss: Hacker News

Recursive self-improvement from AI models

marginalrevolution.com·1d·

Discuss: Hacker News

I Pitted 3 AI Agents Against Each Other. The Result Was Scary.

pub.towardsai.net

·22h

FinovateEurope 2026: From AI Hype To Bank‑Ready Execution

forrester.com·1h

🤖Machine Learning

Task-Completion Time Horizons of Frontier AI Models

metr.org·19h·

Discuss: Hacker News

Palantir: N Of 1, Industrializing Autonomy Via Zero-Marginal-Cost AI Integration

seekingalpha.com

·20h

New Research Shows AI Agents Learn Altruism From Human Behavior

pymnts.com·2d

Loading more...